The Journal of Molecular Diagnostics
○ Elsevier BV
Preprints posted in the last 90 days, ranked by how well they match The Journal of Molecular Diagnostics's content profile, based on 36 papers previously published here. The average preprint has a 0.02% match score for this journal, so anything above that is already an above-average fit.
Filipovic-Sadic, S.; Parker, C. A.; Mihailovic, M. K.; Milligan, J. N.; Turner, J. M.; Borel, S. L.; Le, V.; Markulin, T.; Janovsky, J. W.; Killinger, B. J.; Deshotel, M. J.; Reading, N. S.; Fredrickson, E. K.; Ji, Y.; Close, D.; Wright, J.; Williams, M.; Barrie, E. S.; Martin, K. E.; Gray, S. M.; Haynes, B. C.; Hall, B.
Show abstract
PurposeCarrier screening for hereditary conditions is challenged by genes with complex genomic architecture, where short-read sequencing can fail to detect clinically relevant variants. This study evaluated a unified, amplification-based nanopore sequencing workflow across multiple laboratories for comprehensive analysis of such loci. MethodsA modular long-read sequencing assay was evaluated across five laboratories using targeted PCR enrichment, Oxford Nanopore sequencing, and automated variant analysis. The workflow interrogated genes associated with spinal muscular atrophy, thalassemia, cystic fibrosis, fragile X syndrome, congenital adrenal hyperplasia, Gaucher disease, and hemophilia A. Performance was assessed against orthogonal methods for single nucleotide variants (SNVs), indels, copy-number variants, repeat expansions, and structural rearrangements. ResultsAcross 882 unique samples (1,266 tests), overall agreement with comparator methods exceeded 96% for variant-level detection and 97% for genotype status classification. Long-read sequencing enabled phasing of paralogous loci, integrated sizing and interruption analysis for FMR1 repeats, and simultaneous detection of SNVs and structural variants in globin loci and CYP21A2-TNXB region, reducing reliance on multiple workflows. ConclusionThis multisite evaluation suggests that targeted long-read sequencing can consolidate complex variant detection into a single workflow, improving analytical completeness and operational efficiency for carrier screening.
Bharne, D.; Gaston, D.
Show abstract
One of the current workhorses of next-generation sequencing in clinical molecular diagnostics laboratories for profiling somatic mutations in tumours are amplicon-based targeted sequencing panels. Many open-source somatic variant callers are available; however, their use in clinical applications remains under explored. Therefore, we integrated outputs of six variant callers (FreeBayes, MuTect2, Pisces, Platypus, VarDict and VarScan) into a Snakemake pipeline and evaluated tumour-only data from the HD789 commercial reference standard sequenced in triplicate on three different sequencing runs using the Illumina AmpliSeq Focus panel on MiSeq and NextSeq 2000. A 1:4 dilution sample was sequenced for evaluating limits of variant detection. The called variants were analysed along depth, allele frequency, and other sequencing metrics. The variant callers were evaluated by their level of concordance and performance on known somatic variants. FreeBayes consistently called the largest number of somatic variants in each sample but also included more potential artifacts. Overall, FreeBayes, VarScan, MuTect2, and Pisces had the best performance on HD789 data.
Pham, H. T.; Bussey, K. J.; Oshiro, M. M.; Rounseville, M.; Moses, M.; Zulbaran-Rojas, A.; Nguyen, V.; Bernert, R. A.; Routh, J.; Watts, G.; Block, G. D.; Fisher, W. E.; Nelson, M. A.
Show abstract
ContextPancreatic ductal adenocarcinoma (PDAC) is an aggressive malignancy often diagnosed at advanced stages due to the lack of early clinical symptoms. DNA methylation alterations arise early in PDAC tumorigenesis and may serve as promising biomarkers for blood-based cancer detection. ObjectiveTo evaluate the performance of EPISEEK, a laboratory-developed blood-based multi-cancer early detection (MCED) assay, for detecting PDAC across disease stages. DesignA retrospective cohort study included 97 patients with stage I-IV PDAC and 201 asymptomatic healthy controls. Sensitivity, specificity, area under the curve (AUC), and stage-specific performance were assessed. EPISEEK-MCED performance was also compared with CA 19-9 alone and in combination with CA 19-9. ResultsEPISEEK-MCED classified 65 of 97 PDAC cases as positive, corresponding to an observed sensitivity of 70.1% (95% CI, 60.3% - 78.3%) at 99.5% specificity. The assay demonstrated strong discrimination between PDAC cases and healthy controls, with an AUC of 0.916 (95% CI, 0.88 - 0.952). Sensitivity increased with advancing stage while remaining substantial in early-stage disease, measuring 53.6% for stage I and 65.1% for stage II PDAC, 100% for stage III and 94.7% for stage IV. Across stages, EPISEEK-MCED outperformed CA 19-9 alone, particularly in early-stage disease. Combined analysis of EPISEEK-MCED and CA 19-9 further improved detection performance, achieving sensitivity of 57.1% and 81.4% for stage I and II, respectively. ConclusionsEPISEEK-MCED demonstrated high specificity and sensitivity for PDAC detection across disease stages, including early-stage disease. Combining EPISEEK-MCED with CA19-9 further improved performance, supporting its clinical utility for PDAC detection.
Jones, T. I.; Eriksen, B. Z.; Farooqi, M. N.; Gould, T.; Jones, P. L.; King, O. D.
Show abstract
BackgroundFacioscapulohumeral muscular dystrophy (FSHD) is caused by epigenetic dysregulation at the chromosome 4q35 D4Z4 repeat array under specific permissive genetic conditions. Due to the complexity, expense, and general inaccessibility of FSHD genetic testing, many individuals displaying characteristic muscle weakness are never genetically confirmed and at-risk relatives cannot get screened. We previously developed a targeted bisulfite sequencing (BSS) protocol using the Sanger method to determine DNA methylation levels at specific D4Z4 loci relevant to distinguishing forms of FSHD from non-FSHD that can be used with DNA isolated from saliva, thereby reducing cost and increasing accessibility compared to traditional D4Z4 deletion testing that uses DNA isolated from blood. MethodsHere, we adapt the D4Z4 BSS protocol to next-generation sequencing (NGS) to increase sequencing depth and further reduce cost, validate both sequencing technologies against several cohorts of genetically defined samples, and introduce the D4Z4caster software for computing DNA methylation signatures with diagnostic utility from raw sequencing data. ResultsBoth Sanger and NGS BSS methods using D4Z4caster were validated as providing high sensitivity and specificity, with geometric mean of sensitivity and specificity (G-mean) >95% and area-under-the ROC curve (AUC) of 0.99. The NGS method allows for higher throughput and increased read depth, while the Sanger method allows faster processing of individual samples. Importantly, the NGS method could identify FSHD1 cases that are likely mosaic and would otherwise be missed. ConclusionsD4Z4caster methylation signatures can accurately detect contracted FSHD1-permissive chromosome 4q35 alleles, hypomethylation of D4Z4 arrays indicative of FSHD2, and SNPs that are important for diagnostic use. This workflow is amenable to transitioning to clinical settings for an accurate, low-cost FSHD molecular diagnostic test that could be accessible worldwide. What is already known on this topicCurrently accepted genetic diagnostics for FSHD1 are complex and expensive and can mischaracterize certain complex genetic cases. These diagnostics all require high molecular weight genomic DNA typically freshly isolated from blood, highly specialized equipment, and additional testing for FSHD2, making FSHD diagnostics the most expensive among neuromuscular diseases and inaccessible to much of the world. However, the epigenetic status of the 4q35 and 10q26 D4Z4 repeat arrays, as determined by DNA methylation status using our bisulfite sequencing-based protocol, distinguishes genetically FSHD1, FSHD2, and non-FSHD samples. Additionally, since our protocol is PCR-based, it can utilize DNA isolated from multiple sources, including saliva and buccal swabs. What this study addsThis study validates the relevant DNA methylation signatures against several large cohorts of genetically-confirmed FSHD and non-FSHD samples and optimizes the DNA methylation data analysis for the greater accuracy required for diagnostic utility, including the exclusion of nonpathogenic chromosome 10q or 4A166 contractions. In addition, we introduce the D4Z4caster analysis software, which runs in a portable and scalable Docker container, and provides increased quantitative accuracy important for: 1) confirming likely clinical cases of FSHD that do not meet the currently accepted genetic definition of FSHD1 or FSHD2, 2) identifying FSHD1 somatic mosaicism, and 3) potential prognostic applications. How this study might affect research, practice or policyFSHD1 is genetically defined by a D4Z4 array at the 4q35 locus that is contracted to 1-10 repeat units. However, disease penetrance is influenced by repeat number, epigenetic modifications, and genetic background, causing a misalignment of current genetic diagnosis with clinical diagnosis. This study will improve the accuracy of epigenetic analysis for determining cases of genetic FSHD, help broaden the definition of genetic FSHD to more accurately correspond to clinical FSHD, and allow identification of those at risk for developing clinical FSHD in affected families and in large population studies now being performed and proposed. In addition, it will better inform how an individuals epigenetic status is interpreted for potential prognostic value. Overall, this methodology is: 1) significantly less expensive than current clinically-approved FSHD diagnostic technologies, 2) more accessible due to compatibility with DNA isolated from multiple sources including saliva, and 3) compatible with the current sequencing equipment and workflow for DNA isolation used in commercial clinical laboratories. Together, these advantages will help move the technology toward becoming an approved molecular diagnostic test for FSHD in the USA, Europe, and countries currently lacking clear access to testing.
Halldorsson, S.; Nagymihaly, R. M.; Bope, C. D.; Lund-Iversen, M.; Niehusmann, P.; Lien-Dahl, T.; Pahnke, J.; Bruning, T.; Kongelf, G.; Patel, A.; Sahm, F.; Euskirchen, P.; Leske, H.; Vik-Mo, E. O.
Show abstract
BackgroundClassification of central nervous system (CNS) tumors has become increasingly complex over the past decade, raising concerns about the availability, feasibility and sustainability of comprehensive molecular diagnostics. We have evaluated nanopore whole genome sequencing (nWGS) as a single workflow to replace multiple diagnostic assays. MethodsWe performed nWGS on DNA extracted from 90 adult CNS tumor samples (58 retrospective, 32 prospective) and compared the results to findings from standard of care (SoC) diagnostic work-up. Analysis was done through an automated workflow that consolidated diagnostically and therapeutically relevant genomic alterations, including copy-number variation, structural, and single-nucleotide variants, chromosomal aberrations, gene fusions and methylation-based classification. ResultsNanopore WGS enabled final diagnostic classification in all samples with >15% tumor cell content, requiring [~]3 hours of hands-on library preparation, parallel sample processing, and sequencing times within 72 hours. Methylation-based classification was available within 1 hour and was concordant with the integrated final diagnosis in 89% of cases (80/90). All diagnostically relevant copy-number variations, single-nucleotide variants, and gene fusions were concordant with standard-of-care testing, and MGMT promoter methylation status matched in 94% of cases. In addition, nWGS identified prognostic and potentially actionable variants that were not reported or covered by SoC. ConclusionsNanopore WGS delivers comprehensive genetic and epigenetic results with a fast turn-around compared to standard methods. This enables efficient, accurate, and scalable molecular diagnostics of CNS tumors using a single platform. Its broad applicability supports its implementation in routine clinical practice and may be extended to other cancer types requiring complex genomic profiling.
Ahn, J.; Zack, D.; Zhang, P.
Show abstract
Accurate detection of RNA splice variants is often hindered when transcripts lack large distinguishable exonic regions, making conventional PCR strategies challenging. We developed a simple melting temperature (Tm)-guided exon-exon junction (EEJ) RT-PCR method to enable variant-specific detection under these conditions. Uni-directional primers spanning exon-exon junctions were designed so that approximately each half anneals to adjacent exons. The Tm of each half-site was set >7{degrees}C below the annealing temperature, preventing stable binding to individual exons and enforcing junction-dependent amplification. The method was evaluated using HTRA1-AS1 long noncoding RNA variants that share overlapping exon sequences but differ in splice connectivity. HTRA1-AS1 comprises five variants, only one with a large distinguishable exon. Tm-guided EEJ primers robustly discriminated the remaining four variants. After optimization, amplification yielded sharp, single bands with minimal cross-reactivity. Compared with conventional designs, this approach reduced heteroduplex and heteroquadruplex formation, improving band clarity. Sanger sequencing confirmed junction specificity, and the method performed well in multiplex settings. Overall, Tm-guided EEJ RT-PCR is a cost-effective, high-resolution approach for detecting RNA variants lacking easily distinguishable exonic regions, readily compatible with standard RT-PCR and qPCR workflows.
Camacho Valenzuela, J.; Pelletier, D.; Polak, P.; Fu, L.; Hamel, N.; Domecq, C.; Ahmed, A.; Robles-Espinoza, C. D.; Foulkes, W. D.
Show abstract
Purpose Patients carrying Germline Pathogenic Variants (GPVs) in multiple cancer susceptibility genes (CSGs) can be described within the context of Multi-locus Inherited Neoplasia Allele Syndrome (MINAS). The role of each GPV is typically interpreted based on clinical phenotypes. Here, we used tumor sequencing, particularly mutational signatures, to investigate the contribution of GPVs in MUTYH and PALB2 to colorectal polyposis and breast cancer in a single patient at a molecular level. Methods We analyzed tumor sequencing data, including mutational signatures and genomic scars, of a breast tumor and a colorectal polyp from a patient with biallelic GPVs in MUTYH and a heterozygous GPV in PALB2. Results The colorectal polyp showed a dominant contribution of MUTYH-associated Base Excision Repair deficiency (BERd) mutational signatures, with no evidence of Homologous Recombination Repair Deficiency (HRD). In contrast, the breast tumor showed both MUTYH-driven BERd and HRD-associated signatures, including SBS3, ID6 and an elevated HRD score, despite the absence of a detectable second hit in PALB2. These findings suggest a differential contribution from the CSGs, with MUTYH contributing to both lesions and PALB2 contributing specifically to the breast tumor. The observed pattern does not align with the additive or synergistic models described in MINAS. Conclusions Our study provides evidence that mutational signatures can elucidate the contribution of multiple CSGs to tumorigenesis within a single patient. These findings extend current interpretations of MINAS beyond additive or synergistic phenotypes, which may help to better understand tumor etiology, with potential clinical implications, including eligibility for targeted therapies.
Guerrero Quiles, C.; Lodhi, T.; Sellers, R.; Sahoo, S.; Weightman, J.; Breitwieser, W.; Sanchez Martinez, D.; Bartak, M.; Shamim, A.; Lyons, S.; Reeves, K.; Reed, R.; Hoskin, P.; West, C.; Forker, L.; Smith, T.; Bristow, R.; Wedge, D. C.; Choudhury, A.; Biolatti, L. V.
Show abstract
Whole-genome sequencing (WGS) enables comprehensive analysis of tumour genomes, but its use in formalin-fixed paraffin-embedded (FFPE) samples is limited by DNA fragmentation and low yields. Whole-genome amplification (WGA) methods such as multiple displacement amplification (MDA) can boost DNA availability but distort copy-number alteration (CNA) profiles. DNA ligation-mediated MDA (DLMDA) mitigates this bias by reconstituting fragmented templates, yet its performance in FFPE-derived DNA remains uncertain. We compared paired DLMDA pre-amplified (2h, 8h) and non-pre-amplified FFPE prostate tumour samples from 22 archival blocks (5, 15 and 20 years old). DLMDA increased DNA yield by 42- to 86-fold, with global CNA patterns largely preserved. However, DLMDA significantly reduced the number of detected CNA deletions and amplifications. These effects were independent of both block age and reaction time. CNA dropouts were randomly distributed across the genome, indicating that DLMDA does not introduce regional bias. Our results show that DLMDA enables robust DNA yield recovery and avoids false-positive CNA artefacts, but at the cost of reduced CNA sensitivity. While suitable for CNA screening pipelines through WGS, further improvements are required to minimise the false-negative risk and improve the techniques sensitivity for FFPE-based genomics.
Mavura, Y.; Crosslin, D.; Ferar, K. D.; Lawlor, J. M.; Greally, J. M.; Hindorff, L.; Jarvik, G. P.; Kalla, S.; Koenig, B. A.; Kvale, M.; Kwok, P.-Y.; Norton, M.; Plon, S. E.; Powell, B. C.; Slavotinek, A.; Thompson, M. L.; Popejoy, A. B.; Kenny, E. E.; Risch, N.
Show abstract
PurposeDiagnostic yield from exome and genome sequencing varies widely across studies. It remains unclear how much of this variation reflects patient-level factors (e.g., sex, clinical features, race/ethnicity, genetic ancestry) versus site-level practices such as sequencing modality or variant interpretation workflows. We aimed to quantify the contributions of these factors to diagnostic outcomes across five U.S. clinical sequencing sites. MethodsWe performed a cross-sectional analysis of 3,008 prenatal, neonatal, and pediatric cases from the NHGRI Clinical Sequencing Evidence-Generating Research (CSER) consortium (2017-2023). Clinical indications spanned neurodevelopmental, neurological, immunological, metabolic, craniofacial, skeletal, cardiac, prenatal, and oncologic presentations. Genetic ancestry was inferred from sequencing data, and variants were interpreted using ACMG/AMP guidelines to classify DNA-based diagnoses. Generalized linear mixed models were used to estimate associations between diagnostic yield and fixed effects (sex, prenatal status, isolated cancer, number of clinical indications, sequencing modality, race/ethnicity, and genetic ancestry), while modeling study site as a random effect to quantify between-site variation. ResultsThe overall diagnostic yield was 19.0%. Multiple clinical indications (OR=1.47, 95% CI 1.20-1.80, p<0.001) were associated with higher diagnostic yield, and male sex (OR=0.80, 95% CI 0.66-0.96, p=0.017) and prenatal status (OR=0.63, 95% CI 0.44-0.90, p=0.012) were associated with lower yield. Sequencing modality, race/ethnicity, genetic ancestry, and isolated cancer were not statistically significantly associated with diagnostic outcomes.. A model without fixed effects attributed [~]10% of variance in diagnostic yield to between-site differences. After adjusting for covariates, site-level variance decreased to 5.7%, indicating consistent variation across sites not explained by measured patient factors. ConclusionAcross five sites, patient-level clinical features influenced diagnostic yield, but substantial site-level variation remained even after adjustment. Differences in variant interpretation, or case-classification practices may contribute to this residual variability. Further efforts to increase consistency in exome- and genome-sequencing diagnostic workflows may help reduce inter-site differences.
Subhan, U.; Akram, Z.; Shafqat, S.; Younis, S.
Show abstract
Latent tuberculosis infection (LTBI) remains a significant barrier to global TB control and elimination efforts. The QuantiFERON-TB Gold (QFT) assay is commonly used for the diagnosis of LTBI. However, blood collected in QFT tubes is seldom utilized for molecular and genetic analysis due to the presence of heparin and a dense gel barrier that hinders efficient DNA extraction. To address this limitation, we aimed to develop a method for directly isolating high-quality DNA from blood in QFT tubes, eliminating the need for additional blood sampling and enabling their use in both diagnostic and molecular workflows. In this study, DNA was extracted from blood in EDTA and QFT tubes using a hybrid approach that combined manual lysis with three commercial kits: Thermo Scientific GeneJET, QIAamp DNA Blood Kit, and FavorPrep Blood Genomic DNA Extraction Kit. DNA concentration and purity were measured with a Multiskan SkyHigh Microplate Spectrophotometer, while integrity was assessed through agarose gel electrophoresis. Two nucleic acid amplification techniques (NAATs), ARMS-PCR and whole exome sequencing (WES) were performed to validate applicability of extracted DNA for molecular biology applications. We did not find any differences in the quantity, quality, or application of PCR or sequencing for DNA extracted from EDTA or QFT tubes. The extracted DNA from both EDTA and QFT tubes exhibited A260/280 ratios of 1.7-1.9 and concentrations ranging from 4.9 to 118.5 {micro}g/mL, indicating an adequate yield and purity. Intact genomic DNA and PCR product bands on agarose gel indicated suitability for downstream applications. Additionally, WES produced 6.47-8.71 GB of data per sample, with 42.8-57.7 M reads and GC content between 49.29% and 52.54%. Sequencing metrics were consistently strong, with Q20 values exceeding 98.6% and Q30 values above 95%. Our study presents an optimized and reproducible protocol for extracting high-quality DNA from QFT tubes, producing DNA suitable for both PCR and sequencing technologies. This protocol provides a cost-effective and practical strategy to integrate LTBI diagnosis with genomic research, particularly beneficial in resource-limited settings. This study introduces a novel analytical workflow applicable to diagnostic laboratory settings, enabling the integration of routine LTBI immunodiagnostic testing with downstream genomic analysis. The approach supports improved utilization of clinical specimens in laboratory medicine and may facilitate future biomarker and precision diagnostics research.
Namvar, L.; Sedov, K.; Yang, M. J.; Hermosillo, R.; Zafar, F.; Schuele, B.
Show abstract
BackgroundInduced pluripotent stem cells (iPSCs) are an important model for studying human diseases in vitro. However, previous studies have shown that iPSC reprogramming and extended cell culture can introduce genomic structural variants (SVs). Technologies like karyotyping, CNV microarrays, and whole-genome sequencing have limitations in resolution, sensitivity, or the ability to detect large and complex structural variants compared to optical genome mapping (OGM). OGM is a genome-wide structural variant detection method that analyzes fluorescently labeled ultra-high-molecular-weight DNA molecules to identify copy-number and balanced rearrangements. At sufficient coverage, OGM can detect SVs at approximately [≥]2 kbp and identify mosaic events supported by molecule-level evidence, offering higher resolution than conventional karyotyping or SNP-array-based QC. Here, we compared iPSC clones derived from peripheral blood mononuclear cells (PBMCs) and fibroblasts (FBCs) to determine whether starting somatic cell source is associated with differences in structural variant burden and SV-type profiles after nuclear reprogramming into iPSCs. ResultsWe analyzed 73 low-passage iPSC clones generated from 25 parental lines using OGM. Compared with PBMC-iPSCs, FBC-iPSCs showed higher SV burden with the enrichment of duplications [≥]100 kbp, more frequent overlap with protein-coding genes, fragile sites, and recurrent chromosomal hotspot regions. In contrast, PBMC-iPSCs showed fewer SVs overall, and a higher proportion of clones without detectable clone-specific SVs. ConclusionsOGM provides a high-resolution approach for post-reprogramming genomic quality control by detecting clone-specific structural variants at approximately [≥]2 kbp, including events below the resolution of conventional cytogenetic and SNP-array-based assays. In these early passage iPSCs, SVs overlapped protein-coding genes, fragile sites, and recurrent culture-associated chromosomal regions, underscoring the need for clone-level genomic assessment before downstream applications. FBC-derived iPSCs showed a higher SV burden, including more frequent and larger duplications, whereas PBMC-derived iPSCs more often lacked detectable clone-specific SVs. These findings suggest that PBMC-iPSCs and FBC-iPSCs can differ in post-reprogramming SV profiles and support the use of OGM as a QC strategy during iPSC generation and selection.
Mauer, C.; Reed, J. C.; Mack, A. R.; Theriault, E. A.; Tansarli, G. S.; Fang, F. C.; Bourassa, L.; Greninger, A. L.
Show abstract
Molecular syndromic panels such as the BioFire FilmArray Gastrointestinal Panel (BF-GIP) have been widely adopted for gastrointestinal illness diagnosis due to their fast turnaround times and broad pathogen coverage. Recently, the BF-GIP demonstrated increased rates of norovirus false-positive detections, prompting a Class II recall of more than two million tests in February 2024. We examined the prevalence of BF-GIP norovirus false positives across four hospitals from December 2024 to June 2025. Among 185 BF-GIP norovirus-positive results confirmed with the BD MAX Enteric Viral Panel, the false discovery rate ranged from 31 to 74% across sites, with the highest rate seen at a specialized cancer care hospital. Deep sequencing of BF-GIP pouches (n=42) confirmed the Noro-1 assay as the primary source of off-target amplification, identifying 78 off-target species, predominantly commensal stool bacteria, compared to only two species for the Noro-2 assay. Off-target species amplified by the Noro-1 assay were recovered from both false-positive and true-negative pouches, suggesting no single species accounted for the false-positive results. Partial primer complementarity at off-target loci and amplicon Tm values within the acceptable range support mispriming of gut microbiota as the underlying cause. False-positive pouches exhibited significantly higher Cp values than true positives for both assays (Noro-1: 26.6 vs. 11.1, p=0.013; Noro-2: 30.0 vs. 13.1, p<0.001), consistent with low-level off-target amplification. These findings highlight the high false discovery rate of the Noro-1 assay, identify bacterial species involved in mispriming, and demonstrate the need to redesign this assay to ensure reliable testing and improved patient care.
Warner, B. E.; Patel, J.; Satterwhite, R.; Wang, R.; Adams-Haduch, J.; Koh, W.-P.; Yuan, J.-M.; Shair, K. H. Y.
Show abstract
PurposeAntibodies to Epstein-Barr virus (EBV) proteins can predict nasopharyngeal carcinoma (NPC) risk. We previously defined a prototype EBNA1 protein panel and multiplex immunoblot assay that distinguishes NPC risk several years pre-diagnosis. Assay throughput and specificity are critical to effectively implement a population-level screening program. Here, we developed a strip test assay - EBNA1 SeroStrip-HT - with an objective to increase throughput and maximize specificity. Experimental DesignEBNA1 full-length (FL) and glycine-alanine repeat deletion mutants (dGAr) were purified from insect and mammalian cells to screen serum IgA/IgG from prospective cohorts in Singapore and Shanghai, China, with known time intervals to NPC diagnosis. Twenty pre-diagnostic sera within 4 years to diagnosis were compared to 96 healthy controls using a nested case-control study design. ResultsIgA to mammalian-derived EBNA1 dGAr achieved 85.0% sensitivity and 94.8% specificity (AUC, 0.939) for NPC status. IgA to insect-derived EBNA1 dGAr showed the same sensitivity (85.0%) and similar specificity (93.8%) (AUC, 0.941). IgA to insect-derived EBNA1 FL had a higher 90% sensitivity, but lower 91.7% specificity (AUC, 0.940). Combining EBNA1 FL and dGAr results showed that subjects positive for both proteins had a 243.67 odds ratio for NPC incidence compared to double-negative scores. ConclusionThis study demonstrated the efficacy of EBNA1 SeroStrip-HT for NPC risk assessment and stratification in high- and intermediate-risk populations, yielding high accuracy and a 12-fold increased throughput over the prototype. The insect system was appropriate for large-scale production of purified EBNA1. Larger, geographically diverse cohorts are warranted to confirm these results, especially in low-incidence populations.
Pathmendra, P.; Enguita, F. J.; Byrne, J. A.
Show abstract
Numbers of research articles studying circRNAs have increased rapidly since 2017. Previous analyses of human circRNA articles in two high impact factor cancer research journals identified papers with wrongly identified nucleotide sequence reagents and circRNAs whose identities could not be independently verified. In the present study, verification of human nucleotide sequence reagent and cell line identities in retracted circRNA articles published from 2017-2021 in high impact factor journals found wrongly identified nucleotide sequences and/or cell lines in all 13 retracted papers. Similar analyses of human circRNA papers published in high impact factor journals in 2022 found wrongly identified, non-verifiable and/or questionable reagents in 71% (84/118) papers, where 51% (60/118) papers described at least one wrongly identified reagent. When individual error types and features of concern were considered, 2022 circRNA papers described wrongly identified nucleotide sequence reagents (52/118, 44%), questionable circRNA probes that did not meet accepted targeting requirements (34/118, 29%), non-verifiable nucleotide sequences (25/118, 21%), wrongly identified cell lines (22/118, 19%), and/or non-verifiable cell line identifiers (6/118, 5%). In summary, wrongly identified, non-verifiable and/or questionable reagents were unexpectedly frequent in human circRNA papers in high impact journals, highlighting the need for critical engagement with the circRNA literature.
Maciaszek, J. L.; Pastor Loyola, V.; Cain, T.; Cardenas, M.; Blackburn, P. R.; Wilkinson, M. R.; Koo, S. C.; Wu, C.-H.; Li, C.; Wang, L.; Nichols, K. E.; Klco, J. M.; Eldomery, M. K.
Show abstract
Purpose: Pathogenic or likely pathogenic (P/LP) variants are increasingly identified in genes more commonly associated with adult-onset cancer predisposition, but their prevalence and relevance to children who present with cancer remain unclear. Methods: We retrospectively analyzed 1,280 consecutive pediatric patients with cancer who underwent clinical germline sequencing, using a virtual panel, from 2021 to 2024. Genes with P/LP variants were categorized as aoCPG or pediatric-onset cancer predisposition genes (poCPG) according to cancer risk before age 18 years and pediatric surveillance recommendations. Variant relevance was adjudicated using tumor diagnosis/histopathology, immunohistochemistry, and tumor molecular features and classified as primary, secondary, or indeterminate. Results: Among 1,280 patients, 197 (15.4%) harbored 211 P/LP variants across 54 genes. Sixty-six variants (31.3%) occurred in aoCPG, 87 (41.2%) in poCPG, and 58 (27.5%) were heterozygous variants in autosomal recessive genes. Among adult-onset variants, 7 (10.6%) were primary, 54 (81.8%) secondary, and 5 (7.6%) indeterminate. Among pediatric-onset variants, 77 (88.5%) were primary and 10 (11.5%) secondary. Six patients (3 adult-onset variants; 3 pediatric-onset variants) received targeted therapy informed by germline/somatic sequencing results. Conclusion: In pediatric oncology, most variants in aoCPG are secondary rather than tumor-related findings. Tumor-informed interpretation, beyond variant classification, may improve reporting, counseling, and therapeutic decision-making
Johnstone, J. N.; Phie, J.; Fraser, C.
Show abstract
Validation of somatic mutation burden assays is fundamentally constrained by the absence of a robust ground truth, limiting the interpretability of performance metrics. To address this, we propose a framework based primarily on relative validation, complemented by a suite of secondary metrics aligned to common failure modes. We implement this approach in SomaticCODEC, a ready-to-run assay for quantifying SNV burden in primary human samples, demonstrating strong linearity across mixtures of sperm and blood samples (R2 = 0.91) and high intra-batch precision (CV = 3.3%). This framework provides a practical approach for validating somatic mutation burden assays without requiring a ground truth.
Xiang, J.; Zhu, B.; Xu, H.; Chen, Y.; Sun, X.; xiang, r.; Zhao, Y.; Liu, W.; Zhang, L.; He, J.; liu, j.; Chen, Y.; Fan, Z.; Zhang, H.; Tan, J.; Pang, L.; Shi, L.; Kong, Y.; Cai, A.
Show abstract
Background Thalassemia is one of the most common monogenic disorders worldwide, current screening strategies combining hematological testing with molecular assays still carry a risk of missed diagnoses and undesirable efficiency, particularly for complex structural variants and rare mutations. Methods In this prospective double-blind, multicenter cohort study of 3,842 participants (3,362 pregnant women and 480 male partners), we conducted a head-to-head comparison to systematically evaluate the incremental clinical value and detection performance of single-molecule nanopore sequencing in thalassemia (SMITH) against conventional hematological testing and next-generation sequencing (NGS). Findings The overall concordance rate between NGS and SMITH was 98.6% (3789/3842). The discrepant cases (n=53) were directly attributed to the superior detection capabilities of SMITH, which successfully identified complex structural rearrangements-including 45 -globin gene triplications and four HK alleles-that were missed by NGS. Furthermore, SMITH accurately detected four rare variants (c.134_135insT/, c.-22(C>T)/, {beta}N/{beta}c.316-290delinsAGGGCAATAATTT and {beta}3.5 kb deletion/{beta}N ) and resolved ten trans and three cis configurations within the globin gene allele. Clinically, these technical advantages translated to a 9.3% (5/54) increase in the detection rate of high-risk prenatal couples, effectively preventing one birth affected by moderate-to-severe thalassemia. Additionally, SMITH corrected a diagnostic discrepancy in one case (HK vs. -3.7), sparing the couple from an unnecessary invasive procedure. Interpretation Our findings demonstrate that SMITH provides a powerful platform for resolving globin gene rearrangements, detecting rare variants, and enabling direct haplotype phasing. By effectively eliminating diagnostic blind spots, SMITH is expected to become an optimal method for thalassemia prevention programs. Funding This study was supported by Chinese National Natural Science Foundation Projects 81760037 and 82271894.
Li, Q.; Singh, A.; Hu, R.; Huang, W.; Shapiro, D. D.; Abel, E. J.; Zong, Y.
Show abstract
Although several ancillary tests are available in limited laboratories, diagnosis of microphthalmia (MiT)/TFE family translocation renal cell carcinoma (tRCC) could be challenging due to diverse and overlapping tumor morphology and the lack of reliable biomarkers. GPNMB has been recently identified as a diagnostic marker for various renal neoplasms with FLCN/TSC/mTOR-TFE alterations. However, the sensitivity and specificity of GPNMB immunostain are suboptimal and the result interpretation in ambiguous cases could be difficult. To search additional biomarkers that could improve the screening sensitivity and predict genetic aberrations in FLCN/TSC/mTOR-TFE pathway in renal tumors, we performed bioinformatic analysis of publicly available cancer databases and found GPR143, a transmembrane protein regulated by MiT transcription factors, was highly expressed in a subset of renal cell carcinomas (RCCs). In two the Cancer Genome Atlas (TCGA) kidney cancer cohorts, RCCs with high levels of GPR143 expression were enriched for renal neoplasms with FLCN/TSC/mTOR-TFE alterations. Similar to GPNMB labeling, GPR143 immunostain was positive in the majority of tRCC cases and renal tumors with FLCN/TSC/mTOR alterations, suggesting that GPR143 could function as another surrogate marker for FLCN/TSC/mTOR-TFE alterations in certain renal tumors. Interestingly, despite the concordant GPR143 and GPNMB immunoreactivity in most renal neoplasms with FLCN/TSC/mTOR-TFE alterations, diffuse GPR143 immunostain was observed in some cases with negative or focal GPNMB labeling. Taken together, our results indicate GPR143 could serve as a useful adjunct marker to improve the sensitivity for screening renal tumors with FLCN/TSC/mTOR-TFE alterations.
Gilad, O.; Drogan, C. M.; Keel, E.; Gao, G.; Swallow, C.; Govindarajan, A.; Brar, S.; Heller, M.; Apostolico, T.; Jacobs, M. F.; Gofar, K.; Dudley, B.; Karloski, E.; Lombardi, C.; Springer, M.; Saha, S.; Cox, D.; Lerner, B. A.; Hanna, G.; Chertock, Y.; Khan, A.; Ertan, S.; Hilfrank, K.; Rustgi, S. D.; Singh, A.; Hall, M. J.; Llor, X.; Bansal, A.; Patel, S. G.; Brand, R. E.; Roberts, M. E.; Stanich, P. P.; Stoffel, E.; Katona, B. W.; Aronson, M.; Kupfer, S. S.
Show abstract
Background: Gastric cancer surveillance in CDH1 pathogenic variant carriers is challenging, as predictors of localized (stage T1a) and advanced (stage >T1a) signet ring cell carcinoma (SRCC) are not well defined. We established the Group of investigAtors STriving toward Research In CDH1 (GASTRIC) consortium to identify clinicopathological factors associated with localized and advanced SRCC. Methods: A retrospective observational study (1998-2025) of CDH1 carriers across twelve academic centers was performed. Clinical, endoscopic, and pathological data were compared between carriers with and without SRCC on endoscopy, and between those with advanced versus localized or no cancer on gastrectomy specimens. Results: Overall, 390 CDH1 carriers from 235 families were included. Presence of SRCCs on endoscopy was significantly associated with thickened folds, nodularity, masses, and intestinal metaplasia, while gastritis was negatively associated. Of 196 carriers (52.4%) undergoing gastrectomy, 11 (5.6%) had advanced cancers, 10(90.9%) of which showed endoscopic abnormalities. Identification of SRCC on baseline endoscopy was the most sensitive feature for advanced disease (0.81) but had moderate specificity (0.74), whereas masses and thickened folds were highly specific (0.99 and 0.96, respectively) but less sensitive. Negative predictive values were high (0.94-1.0), while positive predictive values were modest (0.13-0.66). On multivariate analysis, masses and SRCC foci on baseline endoscopy were independent predictors of advanced disease. Conclusion: Among CDH1 carriers, absence of endoscopic findings was reassuring, whereas significance of detected endoscopic and pathological abnormalities was less certain. Advanced cancer occurred in a small number of carriers, with endoscopic abnormalities in nearly all cases. Endoscopic surveillance might be an alternative to surgery in carriers without worrisome mucosal findings.
Gross, S.; Birnbaum, R.; Shaul Lotan, N.; Mor-Shaked, H.; Manor, J.; Shaag, A.; Rosenbluh, C.; Levy-Memo, A.; Yanovsky-Dagan, S.; Saada, A.; Harel, T.
Show abstract
Background: Biallelic variants in GFM2, encoding mitochondrial elongation factor G2 (mtEFG2), a GTPase involved in the termination stage of mitochondrial translation, cause autosomal recessive combined oxidative phosphorylation deficiency. Noncoding structural variants may be missed by exome sequencing but can disrupt splicing and provide opportunities for variant-specific therapeutic rescue. We investigated the molecular mechanism underlying suspected Leigh syndrome in an infant with mitochondrial disease and evaluated whether splice-switching oligonucleotide (SSO) treatment could correct the pathogenic splicing defect. Methods: The proband underwent exome sequencing followed by short-read and long-read whole genome sequencing. RNA sequencing, reverse-transcription PCR, quantitative PCR, and cycloheximide treatment were used to characterize the effect of the identified intronic duplication on GFM2 splicing and transcript stability. Patient-derived fibroblasts were treated with SSOs targeting the aberrant splice junction. Rescue was assessed by RNA studies, western blotting, and spectrophotometric measurement of cytochrome c oxidase (COX). Results: Whole genome sequencing identified a paternally-inherited GFM2 missense variant, NM_032380.5:c.2195C>T p.(Pro732Leu), in trans to a maternally-inherited 221-nucleotide intronic duplication, NM_032380.5:c.2029-741_2029-521dup. RNA studies revealed a 87-nucleotide pseudoexon, generated by activation of a cryptic acceptor splice site within the duplicated sequence. The resulting transcript harbored a premature termination codon (PTC) and underwent nonsense-mediated decay, as confirmed by cycloheximide rescue. Together with reduced mtEFG2 protein levels on western blot, the findings supported a loss-of-function mechanism. Enzymatic analysis of affected fibroblasts showed reduced activity of the mtDNA-dependent complex IV subunit COX, with preservation of the nuclear-encoded complex II enzyme succinate dehydrogenase and the control enzyme citrate synthase, consistent with impaired mitochondrial translation. A SSO targeting the aberrant intron-pseudoexon junction nearly abolished pseudoexon inclusion, restored correctly spliced GFM2 transcript from the duplication-containing allele, increased mtEFG2 protein levels, and significantly improved COX activity. Conclusions: This study identifies a pathogenic intronic GFM2 duplication that causes mitochondrial disease through pseudoexon activation and nonsense-mediated decay. The findings demonstrate the value of integrated genome and transcriptome analysis for exome-negative mitochondrial disease and provide in-vitro proof of concept that SSOs can restore transcript processing, protein expression, and mitochondrial respiratory-chain function in patient-derived cells.